Dataset statistics
| Number of variables | 33 |
|---|---|
| Number of observations | 5819079 |
| Missing cells | 30465274 |
| Missing cells (%) | 15.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.4 GiB |
| Average record size in memory | 264.0 B |
Variable types
| Categorical | 6 |
|---|---|
| Numeric | 16 |
| Text | 2 |
| Unsupported | 2 |
| DateTime | 7 |
YEAR has constant value "" | Constant |
AIRLINE_DELAY is highly overall correlated with CANCELLED and 1 other fields | High correlation |
AIR_SYSTEM_DELAY is highly overall correlated with CANCELLED and 1 other fields | High correlation |
AIR_TIME is highly overall correlated with CANCELLED and 3 other fields | High correlation |
ARRIVAL_DELAY is highly overall correlated with CANCELLED and 2 other fields | High correlation |
CANCELLATION_REASON is highly overall correlated with CANCELLED and 2 other fields | High correlation |
CANCELLED is highly overall correlated with AIRLINE_DELAY and 4 other fields | High correlation |
DEPARTURE_DELAY is highly overall correlated with ARRIVAL_DELAY | High correlation |
DISTANCE is highly overall correlated with AIR_TIME and 1 other fields | High correlation |
DIVERTED is highly overall correlated with AIRLINE_DELAY and 4 other fields | High correlation |
ELAPSED_TIME is highly overall correlated with AIR_TIME and 1 other fields | High correlation |
LATE_AIRCRAFT_DELAY_CAT is highly overall correlated with CANCELLATION_REASON | High correlation |
DIVERTED is highly imbalanced (97.4%) | Imbalance |
CANCELLED is highly imbalanced (88.5%) | Imbalance |
LATE_AIRCRAFT_DELAY_CAT is highly imbalanced (76.3%) | Imbalance |
DEPARTURE_TIME has 86153 (1.5%) missing values | Missing |
DEPARTURE_DELAY has 86153 (1.5%) missing values | Missing |
TAXI_OUT has 89047 (1.5%) missing values | Missing |
WHEELS_OFF has 89047 (1.5%) missing values | Missing |
ELAPSED_TIME has 105071 (1.8%) missing values | Missing |
AIR_TIME has 105071 (1.8%) missing values | Missing |
WHEELS_ON has 92513 (1.6%) missing values | Missing |
TAXI_IN has 92513 (1.6%) missing values | Missing |
ARRIVAL_TIME has 92513 (1.6%) missing values | Missing |
ARRIVAL_DELAY has 105071 (1.8%) missing values | Missing |
CANCELLATION_REASON has 5729195 (98.5%) missing values | Missing |
AIR_SYSTEM_DELAY has 4755640 (81.7%) missing values | Missing |
SECURITY_DELAY has 4755640 (81.7%) missing values | Missing |
AIRLINE_DELAY has 4755640 (81.7%) missing values | Missing |
LATE_AIRCRAFT_DELAY has 4755640 (81.7%) missing values | Missing |
WEATHER_DELAY has 4755640 (81.7%) missing values | Missing |
SECURITY_DELAY is highly skewed (γ1 = 72.12766122) | Skewed |
ORIGIN_AIRPORT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
DESTINATION_AIRPORT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
DEPARTURE_DELAY has 329360 (5.7%) zeros | Zeros |
ARRIVAL_DELAY has 126213 (2.2%) zeros | Zeros |
AIR_SYSTEM_DELAY has 498613 (8.6%) zeros | Zeros |
SECURITY_DELAY has 1059955 (18.2%) zeros | Zeros |
AIRLINE_DELAY has 493417 (8.5%) zeros | Zeros |
LATE_AIRCRAFT_DELAY has 506486 (8.7%) zeros | Zeros |
WEATHER_DELAY has 998723 (17.2%) zeros | Zeros |
Reproduction
| Analysis started | 2023-12-17 09:20:21.842856 |
|---|---|
| Analysis finished | 2023-12-17 09:29:16.567911 |
| Duration | 8 minutes and 54.73 seconds |
| Software version | ydata-profiling vv4.6.3 |
| Download configuration | config.json |
YEAR
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.4 MiB |
| 2015 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 23276316 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015 |
|---|---|
| 2nd row | 2015 |
| 3rd row | 2015 |
| 4th row | 2015 |
| 5th row | 2015 |
Common Values
| Value | Count | Frequency (%) |
| 2015 | 5819079 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2015 | 5819079 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 5819079 | |
| 0 | 5819079 | |
| 1 | 5819079 | |
| 5 | 5819079 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 23276316 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 5819079 | |
| 0 | 5819079 | |
| 1 | 5819079 | |
| 5 | 5819079 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23276316 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 5819079 | |
| 0 | 5819079 | |
| 1 | 5819079 | |
| 5 | 5819079 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23276316 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 5819079 | |
| 0 | 5819079 | |
| 1 | 5819079 | |
| 5 | 5819079 |
MONTH
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.5240852 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.4051368 |
|---|---|
| Coefficient of variation (CV) | 0.52193323 |
| Kurtosis | -1.1756822 |
| Mean | 6.5240852 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.0036838264 |
| Sum | 37964167 |
| Variance | 11.594957 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 7 | 520718 | |
| 8 | 510536 | |
| 3 | 504312 | |
| 6 | 503897 | |
| 5 | 496993 | |
| 10 | 486165 | |
| 4 | 485151 | |
| 12 | 479230 | |
| 1 | 469968 | |
| 11 | 467972 | |
| Other values (2) | 894137 |
| Value | Count | Frequency (%) |
| 1 | 469968 | |
| 2 | 429191 | |
| 3 | 504312 | |
| 4 | 485151 | |
| 5 | 496993 | |
| 6 | 503897 | |
| 7 | 520718 | |
| 8 | 510536 | |
| 9 | 464946 | |
| 10 | 486165 |
| Value | Count | Frequency (%) |
| 12 | 479230 | |
| 11 | 467972 | |
| 10 | 486165 | |
| 9 | 464946 | |
| 8 | 510536 | |
| 7 | 520718 | |
| 6 | 503897 | |
| 5 | 496993 | |
| 4 | 485151 | |
| 3 | 504312 |
DAY
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.704594 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.7834251 |
|---|---|
| Coefficient of variation (CV) | 0.55929017 |
| Kurtosis | -1.1892082 |
| Mean | 15.704594 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.0086667031 |
| Sum | 91386273 |
| Variance | 77.148556 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 195986 | 3.4% |
| 16 | 195899 | 3.4% |
| 20 | 195707 | 3.4% |
| 13 | 195089 | 3.4% |
| 9 | 194224 | 3.3% |
| 8 | 193964 | 3.3% |
| 23 | 193560 | 3.3% |
| 19 | 193284 | 3.3% |
| 15 | 192950 | 3.3% |
| 22 | 192725 | 3.3% |
| Other values (21) | 3875691 |
| Value | Count | Frequency (%) |
| 1 | 189477 | |
| 2 | 195986 | |
| 3 | 190007 | |
| 4 | 190893 | |
| 5 | 189766 | |
| 6 | 191232 | |
| 7 | 187598 | |
| 8 | 193964 | |
| 9 | 194224 | |
| 10 | 189288 |
| Value | Count | Frequency (%) |
| 31 | 103812 | |
| 30 | 178771 | |
| 29 | 179441 | |
| 28 | 191401 | |
| 27 | 191920 | |
| 26 | 187387 | |
| 25 | 187317 | |
| 24 | 185017 | |
| 23 | 193560 | |
| 22 | 192725 |
DAY_OF_WEEK
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.9269412 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.988845 |
|---|---|
| Coefficient of variation (CV) | 0.50646162 |
| Kurtosis | -1.2117267 |
| Mean | 3.9269412 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.057035635 |
| Sum | 22851181 |
| Variance | 3.9555045 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 872521 | |
| 1 | 865543 | |
| 5 | 862209 | |
| 3 | 855897 | |
| 2 | 844600 | |
| 7 | 817764 | |
| 6 | 700545 |
| Value | Count | Frequency (%) |
| 1 | 865543 | |
| 2 | 844600 | |
| 3 | 855897 | |
| 4 | 872521 | |
| 5 | 862209 | |
| 6 | 700545 | |
| 7 | 817764 |
| Value | Count | Frequency (%) |
| 7 | 817764 | |
| 6 | 700545 | |
| 5 | 862209 | |
| 4 | 872521 | |
| 3 | 855897 | |
| 2 | 844600 | |
| 1 | 865543 |
AIRLINE
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.4 MiB |
| WN | |
|---|---|
| DL | |
| AA | |
| OO | |
| EV | |
| Other values (9) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 11638158 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AS |
|---|---|
| 2nd row | AA |
| 3rd row | US |
| 4th row | AA |
| 5th row | AS |
Common Values
| Value | Count | Frequency (%) |
| WN | 1261855 | |
| DL | 875881 | |
| AA | 725984 | |
| OO | 588353 | |
| EV | 571977 | |
| UA | 515723 | |
| MQ | 294632 | 5.1% |
| B6 | 267048 | 4.6% |
| US | 198715 | 3.4% |
| AS | 172521 | 3.0% |
| Other values (4) | 346390 | 6.0% |
Length
| Value | Count | Frequency (%) |
| wn | 1261855 | |
| dl | 875881 | |
| aa | 725984 | |
| oo | 588353 | |
| ev | 571977 | |
| ua | 515723 | |
| mq | 294632 | 5.1% |
| b6 | 267048 | 4.6% |
| us | 198715 | 3.4% |
| as | 172521 | 3.0% |
| Other values (4) | 346390 | 6.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 2216484 | |
| N | 1379234 | |
| W | 1261855 | |
| O | 1176706 | |
| D | 875881 | 7.5% |
| L | 875881 | 7.5% |
| U | 714438 | 6.1% |
| V | 633880 | 5.4% |
| E | 571977 | 4.9% |
| S | 371236 | 3.2% |
| Other values (9) | 1560586 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 11280274 | |
| Decimal Number | 357884 | 3.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2216484 | |
| N | 1379234 | |
| W | 1261855 | |
| O | 1176706 | |
| D | 875881 | 7.8% |
| L | 875881 | 7.8% |
| U | 714438 | 6.3% |
| V | 633880 | 5.6% |
| E | 571977 | 5.1% |
| S | 371236 | 3.3% |
| Other values (7) | 1202702 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 267048 | |
| 9 | 90836 | 25.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11280274 | |
| Common | 357884 | 3.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 2216484 | |
| N | 1379234 | |
| W | 1261855 | |
| O | 1176706 | |
| D | 875881 | 7.8% |
| L | 875881 | 7.8% |
| U | 714438 | 6.3% |
| V | 633880 | 5.6% |
| E | 571977 | 5.1% |
| S | 371236 | 3.3% |
| Other values (7) | 1202702 |
Common
| Value | Count | Frequency (%) |
| 6 | 267048 | |
| 9 | 90836 | 25.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11638158 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 2216484 | |
| N | 1379234 | |
| W | 1261855 | |
| O | 1176706 | |
| D | 875881 | 7.5% |
| L | 875881 | 7.5% |
| U | 714438 | 6.1% |
| V | 633880 | 5.4% |
| E | 571977 | 4.9% |
| S | 371236 | 3.2% |
| Other values (9) | 1560586 |
FLIGHT_NUMBER
Real number (ℝ)
| Distinct | 6952 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2173.0927 |
| Minimum | 1 |
|---|---|
| Maximum | 9855 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 167 |
| Q1 | 730 |
| median | 1690 |
| Q3 | 3230 |
| 95-th percentile | 5565 |
| Maximum | 9855 |
| Range | 9854 |
| Interquartile range (IQR) | 2500 |
Descriptive statistics
| Standard deviation | 1757.064 |
|---|---|
| Coefficient of variation (CV) | 0.80855454 |
| Kurtosis | -0.27916561 |
| Mean | 2173.0927 |
| Median Absolute Deviation (MAD) | 1096 |
| Skewness | 0.85646125 |
| Sum | 1.2645398 × 1010 |
| Variance | 3087273.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 469 | 3975 | 0.1% |
| 327 | 3554 | 0.1% |
| 326 | 3513 | 0.1% |
| 188 | 3386 | 0.1% |
| 403 | 3370 | 0.1% |
| 667 | 3360 | 0.1% |
| 407 | 3324 | 0.1% |
| 315 | 3321 | 0.1% |
| 223 | 3291 | 0.1% |
| 61 | 3266 | 0.1% |
| Other values (6942) | 5784719 |
| Value | Count | Frequency (%) |
| 1 | 2393 | |
| 2 | 1973 | |
| 3 | 2890 | |
| 4 | 1772 | |
| 5 | 2271 | |
| 6 | 1418 | |
| 7 | 2015 | |
| 8 | 2820 | |
| 9 | 1647 | |
| 10 | 1506 |
| Value | Count | Frequency (%) |
| 9855 | 1 | < 0.1% |
| 9794 | 1 | < 0.1% |
| 9793 | 1 | < 0.1% |
| 9320 | 1 | < 0.1% |
| 8445 | 1 | < 0.1% |
| 8442 | 1 | < 0.1% |
| 8410 | 1 | < 0.1% |
| 8409 | 1 | < 0.1% |
| 7438 | 516 | |
| 7433 | 3 | < 0.1% |
TAIL_NUMBER
Text
| Distinct | 4897 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 14721 |
| Missing (%) | 0.3% |
| Memory size | 44.4 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.997398 |
| Min length | 5 |
Characters and Unicode
| Total characters | 34811045 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | N407AS |
|---|---|
| 2nd row | N3KUAA |
| 3rd row | N171US |
| 4th row | N3HYAA |
| 5th row | N527AS |
| Value | Count | Frequency (%) |
| n480ha | 3768 | 0.1% |
| n488ha | 3723 | 0.1% |
| n484ha | 3723 | 0.1% |
| n493ha | 3585 | 0.1% |
| n478ha | 3577 | 0.1% |
| n483ha | 3528 | 0.1% |
| n486ha | 3513 | 0.1% |
| n491ha | 3494 | 0.1% |
| n489ha | 3477 | 0.1% |
| n477ha | 3402 | 0.1% |
| Other values (4887) | 5768568 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 6830260 | |
| A | 2241035 | 6.4% |
| 3 | 2071106 | 5.9% |
| 9 | 2001378 | 5.7% |
| 6 | 1988220 | 5.7% |
| 7 | 1961545 | 5.6% |
| 1 | 1882399 | 5.4% |
| 5 | 1860773 | 5.3% |
| 4 | 1842975 | 5.3% |
| 2 | 1762716 | 5.1% |
| Other values (24) | 10368638 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18285372 | |
| Uppercase Letter | 16525673 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 6830260 | |
| A | 2241035 | 13.6% |
| W | 1556927 | 9.4% |
| S | 1214594 | 7.3% |
| U | 549084 | 3.3% |
| D | 526467 | 3.2% |
| B | 460822 | 2.8% |
| Q | 349218 | 2.1% |
| M | 334524 | 2.0% |
| K | 330402 | 2.0% |
| Other values (14) | 2132340 | 12.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2071106 | |
| 9 | 2001378 | |
| 6 | 1988220 | |
| 7 | 1961545 | |
| 1 | 1882399 | |
| 5 | 1860773 | |
| 4 | 1842975 | |
| 2 | 1762716 | |
| 8 | 1725865 | |
| 0 | 1188395 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18285372 | |
| Latin | 16525673 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 6830260 | |
| A | 2241035 | 13.6% |
| W | 1556927 | 9.4% |
| S | 1214594 | 7.3% |
| U | 549084 | 3.3% |
| D | 526467 | 3.2% |
| B | 460822 | 2.8% |
| Q | 349218 | 2.1% |
| M | 334524 | 2.0% |
| K | 330402 | 2.0% |
| Other values (14) | 2132340 | 12.9% |
Common
| Value | Count | Frequency (%) |
| 3 | 2071106 | |
| 9 | 2001378 | |
| 6 | 1988220 | |
| 7 | 1961545 | |
| 1 | 1882399 | |
| 5 | 1860773 | |
| 4 | 1842975 | |
| 2 | 1762716 | |
| 8 | 1725865 | |
| 0 | 1188395 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34811045 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 6830260 | |
| A | 2241035 | 6.4% |
| 3 | 2071106 | 5.9% |
| 9 | 2001378 | 5.7% |
| 6 | 1988220 | 5.7% |
| 7 | 1961545 | 5.6% |
| 1 | 1882399 | 5.4% |
| 5 | 1860773 | 5.3% |
| 4 | 1842975 | 5.3% |
| 2 | 1762716 | 5.1% |
| Other values (24) | 10368638 |
ORIGIN_AIRPORT
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 44.4 MiB |
DESTINATION_AIRPORT
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 44.4 MiB |
| Distinct | 1262 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.4 MiB |
| Minimum | 2023-12-17 00:00:00 |
|---|---|
| Maximum | 2023-12-17 23:59:00 |
DEPARTURE_TIME
Date
MISSING 
| Distinct | 1381 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 86153 |
| Missing (%) | 1.5% |
| Memory size | 44.4 MiB |
| Minimum | 2023-12-17 00:00:00 |
|---|---|
| Maximum | 2023-12-17 23:59:00 |
DEPARTURE_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 1217 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 86153 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.3701583 |
| Minimum | -82 |
|---|---|
| Maximum | 1988 |
| Zeros | 329360 |
| Zeros (%) | 5.7% |
| Negative | 3277948 |
| Negative (%) | 56.3% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | -82 |
|---|---|
| 5-th percentile | -9 |
| Q1 | -5 |
| median | -2 |
| Q3 | 7 |
| 95-th percentile | 67 |
| Maximum | 1988 |
| Range | 2070 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 37.080942 |
|---|---|
| Coefficient of variation (CV) | 3.9573443 |
| Kurtosis | 123.00564 |
| Mean | 9.3701583 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 7.5928693 |
| Sum | 53718424 |
| Variance | 1374.9963 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -3 | 455407 | 7.8% |
| -4 | 444053 | 7.6% |
| -5 | 438844 | 7.5% |
| -2 | 435237 | 7.5% |
| -1 | 387475 | 6.7% |
| 0 | 329360 | 5.7% |
| -6 | 324242 | 5.6% |
| -7 | 242933 | 4.2% |
| -8 | 173407 | 3.0% |
| 1 | 160076 | 2.8% |
| Other values (1207) | 2341892 |
| Value | Count | Frequency (%) |
| -82 | 1 | < 0.1% |
| -68 | 1 | < 0.1% |
| -61 | 1 | < 0.1% |
| -56 | 1 | < 0.1% |
| -55 | 1 | < 0.1% |
| -52 | 1 | < 0.1% |
| -48 | 2 | |
| -47 | 1 | < 0.1% |
| -46 | 1 | < 0.1% |
| -45 | 4 |
| Value | Count | Frequency (%) |
| 1988 | 1 | |
| 1878 | 1 | |
| 1670 | 1 | |
| 1649 | 1 | |
| 1631 | 1 | |
| 1625 | 1 | |
| 1609 | 1 | |
| 1604 | 1 | |
| 1589 | 1 | |
| 1587 | 1 |
TAXI_OUT
Real number (ℝ)
MISSING 
| Distinct | 184 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 89047 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.071662 |
| Minimum | 1 |
|---|---|
| Maximum | 225 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 11 |
| median | 14 |
| Q3 | 19 |
| 95-th percentile | 31 |
| Maximum | 225 |
| Range | 224 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 8.8955741 |
|---|---|
| Coefficient of variation (CV) | 0.55349434 |
| Kurtosis | 24.002893 |
| Mean | 16.071662 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 3.4671477 |
| Sum | 92091139 |
| Variance | 79.131238 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 463189 | 8.0% |
| 11 | 462159 | 7.9% |
| 13 | 440243 | 7.6% |
| 10 | 430606 | 7.4% |
| 14 | 402938 | 6.9% |
| 9 | 360368 | 6.2% |
| 15 | 359214 | 6.2% |
| 16 | 312858 | 5.4% |
| 17 | 270827 | 4.7% |
| 8 | 261803 | 4.5% |
| Other values (174) | 1965827 |
| Value | Count | Frequency (%) |
| 1 | 220 | < 0.1% |
| 2 | 353 | < 0.1% |
| 3 | 1716 | < 0.1% |
| 4 | 6141 | 0.1% |
| 5 | 23185 | 0.4% |
| 6 | 75226 | 1.3% |
| 7 | 160802 | 2.8% |
| 8 | 261803 | |
| 9 | 360368 | |
| 10 | 430606 |
| Value | Count | Frequency (%) |
| 225 | 1 | < 0.1% |
| 200 | 1 | < 0.1% |
| 185 | 1 | < 0.1% |
| 181 | 1 | < 0.1% |
| 180 | 1 | < 0.1% |
| 179 | 1 | < 0.1% |
| 178 | 1 | < 0.1% |
| 177 | 4 | |
| 176 | 1 | < 0.1% |
| 175 | 2 |
WHEELS_OFF
Date
MISSING 
| Distinct | 1381 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 89047 |
| Missing (%) | 1.5% |
| Memory size | 44.4 MiB |
| Minimum | 2023-12-17 00:00:00 |
|---|---|
| Maximum | 2023-12-17 23:59:00 |
SCHEDULED_TIME
Text
| Distinct | 550 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 44.4 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 46552584 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 02:05 AM |
|---|---|
| 2nd row | 02:80 AM |
| 3rd row | 02:86 AM |
| 4th row | 02:85 AM |
| 5th row | 02:35 AM |
| Value | Count | Frequency (%) |
| am | 5819073 | |
| 00:85 | 115062 | 1.0% |
| 00:80 | 112856 | 1.0% |
| 00:75 | 105978 | 0.9% |
| 00:90 | 101926 | 0.9% |
| 00:70 | 96823 | 0.8% |
| 00:65 | 91119 | 0.8% |
| 00:95 | 85242 | 0.7% |
| 01:10 | 79296 | 0.7% |
| 01:15 | 74520 | 0.6% |
| Other values (541) | 4956251 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9589032 | |
| : | 5819073 | |
| 5819073 | ||
| A | 5819073 | |
| M | 5819073 | |
| 1 | 3685602 | 7.9% |
| 5 | 1775417 | 3.8% |
| 2 | 1597071 | 3.4% |
| 3 | 1202243 | 2.6% |
| 7 | 1176189 | 2.5% |
| Other values (4) | 4250738 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 23276292 | |
| Uppercase Letter | 11638146 | |
| Other Punctuation | 5819073 | 12.5% |
| Space Separator | 5819073 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9589032 | |
| 1 | 3685602 | 15.8% |
| 5 | 1775417 | 7.6% |
| 2 | 1597071 | 6.9% |
| 3 | 1202243 | 5.2% |
| 7 | 1176189 | 5.1% |
| 8 | 1142696 | 4.9% |
| 6 | 1124045 | 4.8% |
| 9 | 1054456 | 4.5% |
| 4 | 929541 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 5819073 | |
| M | 5819073 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 5819073 |
Space Separator
| Value | Count | Frequency (%) |
| 5819073 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 34914438 | |
| Latin | 11638146 | 25.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9589032 | |
| : | 5819073 | |
| 5819073 | ||
| 1 | 3685602 | 10.6% |
| 5 | 1775417 | 5.1% |
| 2 | 1597071 | 4.6% |
| 3 | 1202243 | 3.4% |
| 7 | 1176189 | 3.4% |
| 8 | 1142696 | 3.3% |
| 6 | 1124045 | 3.2% |
| Other values (2) | 1983997 | 5.7% |
Latin
| Value | Count | Frequency (%) |
| A | 5819073 | |
| M | 5819073 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46552584 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9589032 | |
| : | 5819073 | |
| 5819073 | ||
| A | 5819073 | |
| M | 5819073 | |
| 1 | 3685602 | 7.9% |
| 5 | 1775417 | 3.8% |
| 2 | 1597071 | 3.4% |
| 3 | 1202243 | 2.6% |
| 7 | 1176189 | 2.5% |
| Other values (4) | 4250738 |
ELAPSED_TIME
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 712 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 105071 |
| Missing (%) | 1.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 137.00619 |
| Minimum | 14 |
|---|---|
| Maximum | 766 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 14 |
|---|---|
| 5-th percentile | 54 |
| Q1 | 82 |
| median | 118 |
| Q3 | 168 |
| 95-th percentile | 299 |
| Maximum | 766 |
| Range | 752 |
| Interquartile range (IQR) | 86 |
Descriptive statistics
| Standard deviation | 74.211072 |
|---|---|
| Coefficient of variation (CV) | 0.54166218 |
| Kurtosis | 2.0543165 |
| Mean | 137.00619 |
| Median Absolute Deviation (MAD) | 41 |
| Skewness | 1.3532223 |
| Sum | 7.8285446 × 108 |
| Variance | 5507.2832 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 47441 | 0.8% |
| 79 | 47049 | 0.8% |
| 81 | 46966 | 0.8% |
| 82 | 46679 | 0.8% |
| 78 | 46287 | 0.8% |
| 77 | 46142 | 0.8% |
| 76 | 46041 | 0.8% |
| 83 | 45659 | 0.8% |
| 84 | 45619 | 0.8% |
| 75 | 45312 | 0.8% |
| Other values (702) | 5250813 | |
| (Missing) | 105071 | 1.8% |
| Value | Count | Frequency (%) |
| 14 | 3 | < 0.1% |
| 15 | 9 | < 0.1% |
| 16 | 29 | < 0.1% |
| 17 | 46 | |
| 18 | 58 | |
| 19 | 56 | |
| 20 | 39 | |
| 21 | 70 | |
| 22 | 69 | |
| 23 | 80 |
| Value | Count | Frequency (%) |
| 766 | 1 | |
| 735 | 1 | |
| 733 | 1 | |
| 731 | 1 | |
| 730 | 1 | |
| 727 | 1 | |
| 726 | 1 | |
| 724 | 1 | |
| 721 | 1 | |
| 719 | 1 |
AIR_TIME
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 675 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 105071 |
| Missing (%) | 1.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 113.51163 |
| Minimum | 7 |
|---|---|
| Maximum | 690 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 34 |
| Q1 | 60 |
| median | 94 |
| Q3 | 144 |
| 95-th percentile | 273 |
| Maximum | 690 |
| Range | 683 |
| Interquartile range (IQR) | 84 |
Descriptive statistics
| Standard deviation | 72.230822 |
|---|---|
| Coefficient of variation (CV) | 0.63632971 |
| Kurtosis | 2.0956915 |
| Mean | 113.51163 |
| Median Absolute Deviation (MAD) | 39 |
| Skewness | 1.3783417 |
| Sum | 6.4860635 × 108 |
| Variance | 5217.2916 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 64 | 49791 | 0.9% |
| 63 | 49760 | 0.9% |
| 62 | 49476 | 0.9% |
| 65 | 49393 | 0.8% |
| 61 | 49215 | 0.8% |
| 43 | 48785 | 0.8% |
| 60 | 48736 | 0.8% |
| 59 | 48405 | 0.8% |
| 66 | 48334 | 0.8% |
| 44 | 48295 | 0.8% |
| Other values (665) | 5223818 | |
| (Missing) | 105071 | 1.8% |
| Value | Count | Frequency (%) |
| 7 | 7 | < 0.1% |
| 8 | 68 | < 0.1% |
| 9 | 134 | < 0.1% |
| 10 | 128 | < 0.1% |
| 11 | 112 | < 0.1% |
| 12 | 88 | < 0.1% |
| 13 | 208 | < 0.1% |
| 14 | 549 | < 0.1% |
| 15 | 1118 | |
| 16 | 2007 |
| Value | Count | Frequency (%) |
| 690 | 2 | |
| 687 | 2 | |
| 684 | 1 | |
| 683 | 2 | |
| 682 | 1 | |
| 679 | 1 | |
| 678 | 1 | |
| 676 | 1 | |
| 674 | 1 | |
| 672 | 1 |
DISTANCE
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1363 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 822.35649 |
| Minimum | 21 |
|---|---|
| Maximum | 4983 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 21 |
|---|---|
| 5-th percentile | 173 |
| Q1 | 373 |
| median | 647 |
| Q3 | 1062 |
| 95-th percentile | 2227 |
| Maximum | 4983 |
| Range | 4962 |
| Interquartile range (IQR) | 689 |
Descriptive statistics
| Standard deviation | 607.78429 |
|---|---|
| Coefficient of variation (CV) | 0.73907641 |
| Kurtosis | 2.2473611 |
| Mean | 822.35649 |
| Median Absolute Deviation (MAD) | 322 |
| Skewness | 1.4224682 |
| Sum | 4.7853574 × 109 |
| Variance | 369401.74 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 337 | 50069 | 0.9% |
| 447 | 28096 | 0.5% |
| 594 | 27629 | 0.5% |
| 404 | 27429 | 0.5% |
| 2475 | 26219 | 0.5% |
| 867 | 25496 | 0.4% |
| 399 | 25118 | 0.4% |
| 862 | 23799 | 0.4% |
| 214 | 23454 | 0.4% |
| 236 | 22316 | 0.4% |
| Other values (1353) | 5539454 |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 31 | 726 | < 0.1% |
| 36 | 1 | < 0.1% |
| 41 | 154 | < 0.1% |
| 49 | 1 | < 0.1% |
| 52 | 1 | < 0.1% |
| 62 | 2 | < 0.1% |
| 67 | 11177 | |
| 68 | 1524 | < 0.1% |
| 69 | 991 | < 0.1% |
| Value | Count | Frequency (%) |
| 4983 | 682 | |
| 4962 | 722 | |
| 4817 | 344 | < 0.1% |
| 4502 | 729 | |
| 4243 | 730 | |
| 4184 | 238 | < 0.1% |
| 3972 | 94 | < 0.1% |
| 3904 | 730 | |
| 3801 | 730 | |
| 3784 | 1518 |
WHEELS_ON
Date
MISSING 
| Distinct | 1381 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 92513 |
| Missing (%) | 1.6% |
| Memory size | 44.4 MiB |
| Minimum | 2023-12-17 00:00:00 |
|---|---|
| Maximum | 2023-12-17 23:59:00 |
TAXI_IN
Real number (ℝ)
MISSING 
| Distinct | 185 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 92513 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.4349708 |
| Minimum | 1 |
|---|---|
| Maximum | 248 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 16 |
| Maximum | 248 |
| Range | 247 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 5.6385477 |
|---|---|
| Coefficient of variation (CV) | 0.75838195 |
| Kurtosis | 58.922996 |
| Mean | 7.4349708 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 5.1297319 |
| Sum | 42576851 |
| Variance | 31.79322 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 932909 | |
| 4 | 923558 | |
| 6 | 759134 | |
| 7 | 567620 | |
| 3 | 524797 | |
| 8 | 423947 | |
| 9 | 316600 | 5.4% |
| 10 | 243087 | 4.2% |
| 11 | 182533 | 3.1% |
| 12 | 139532 | 2.4% |
| Other values (175) | 712849 |
| Value | Count | Frequency (%) |
| 1 | 5695 | 0.1% |
| 2 | 117453 | 2.0% |
| 3 | 524797 | |
| 4 | 923558 | |
| 5 | 932909 | |
| 6 | 759134 | |
| 7 | 567620 | |
| 8 | 423947 | |
| 9 | 316600 | 5.4% |
| 10 | 243087 | 4.2% |
| Value | Count | Frequency (%) |
| 248 | 1 | |
| 202 | 1 | |
| 197 | 1 | |
| 184 | 1 | |
| 183 | 1 | |
| 181 | 1 | |
| 180 | 1 | |
| 179 | 1 | |
| 178 | 1 | |
| 177 | 1 |
| Distinct | 1376 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.4 MiB |
| Minimum | 2023-12-17 00:00:00 |
|---|---|
| Maximum | 2023-12-17 23:59:00 |
ARRIVAL_TIME
Date
MISSING 
| Distinct | 1381 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 92513 |
| Missing (%) | 1.6% |
| Memory size | 44.4 MiB |
| Minimum | 2023-12-17 00:00:00 |
|---|---|
| Maximum | 2023-12-17 23:59:00 |
ARRIVAL_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 1240 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 105071 |
| Missing (%) | 1.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4070574 |
| Minimum | -87 |
|---|---|
| Maximum | 1971 |
| Zeros | 126213 |
| Zeros (%) | 2.2% |
| Negative | 3500899 |
| Negative (%) | 60.2% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | -87 |
|---|---|
| 5-th percentile | -25 |
| Q1 | -13 |
| median | -5 |
| Q3 | 8 |
| 95-th percentile | 66 |
| Maximum | 1971 |
| Range | 2058 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 39.271297 |
|---|---|
| Coefficient of variation (CV) | 8.911002 |
| Kurtosis | 97.739803 |
| Mean | 4.4070574 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 6.5028962 |
| Sum | 25181961 |
| Variance | 1542.2348 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -8 | 176899 | 3.0% |
| -9 | 176016 | 3.0% |
| -10 | 175232 | 3.0% |
| -7 | 174524 | 3.0% |
| -11 | 171557 | 2.9% |
| -6 | 169411 | 2.9% |
| -12 | 165214 | 2.8% |
| -5 | 164176 | 2.8% |
| -13 | 158464 | 2.7% |
| -4 | 157472 | 2.7% |
| Other values (1230) | 4025043 |
| Value | Count | Frequency (%) |
| -87 | 2 | |
| -82 | 1 | < 0.1% |
| -81 | 2 | |
| -80 | 3 | |
| -79 | 2 | |
| -78 | 1 | < 0.1% |
| -77 | 2 | |
| -76 | 3 | |
| -75 | 1 | < 0.1% |
| -74 | 3 |
| Value | Count | Frequency (%) |
| 1971 | 1 | |
| 1898 | 1 | |
| 1665 | 1 | |
| 1638 | 1 | |
| 1636 | 2 | |
| 1627 | 1 | |
| 1598 | 1 | |
| 1593 | 1 | |
| 1576 | 1 | |
| 1574 | 1 |
DIVERTED
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.4 MiB |
| 0 | |
|---|---|
| 1 | 15187 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5819079 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 5803892 | |
| 1 | 15187 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 5803892 | |
| 1 | 15187 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5803892 | |
| 1 | 15187 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5819079 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5803892 | |
| 1 | 15187 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5819079 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5803892 | |
| 1 | 15187 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5819079 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5803892 | |
| 1 | 15187 | 0.3% |
CANCELLED
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.4 MiB |
| 0 | |
|---|---|
| 1 | 89884 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5819079 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 5729195 | |
| 1 | 89884 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 5729195 | |
| 1 | 89884 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5729195 | |
| 1 | 89884 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5819079 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5729195 | |
| 1 | 89884 | 1.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5819079 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5729195 | |
| 1 | 89884 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5819079 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5729195 | |
| 1 | 89884 | 1.5% |
CANCELLATION_REASON
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5729195 |
| Missing (%) | 98.5% |
| Memory size | 44.4 MiB |
| B | |
|---|---|
| A | |
| C | |
| D | 22 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 89884 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | B |
| 3rd row | B |
| 4th row | B |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| B | 48851 | 0.8% |
| A | 25262 | 0.4% |
| C | 15749 | 0.3% |
| D | 22 | < 0.1% |
| (Missing) | 5729195 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| b | 48851 | |
| a | 25262 | |
| c | 15749 | 17.5% |
| d | 22 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 48851 | |
| A | 25262 | |
| C | 15749 | 17.5% |
| D | 22 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 89884 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 48851 | |
| A | 25262 | |
| C | 15749 | 17.5% |
| D | 22 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 89884 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 48851 | |
| A | 25262 | |
| C | 15749 | 17.5% |
| D | 22 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 89884 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 48851 | |
| A | 25262 | |
| C | 15749 | 17.5% |
| D | 22 | < 0.1% |
AIR_SYSTEM_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 570 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4755640 |
| Missing (%) | 81.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.480568 |
| Minimum | 0 |
|---|---|
| Maximum | 1134 |
| Zeros | 498613 |
| Zeros (%) | 8.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 18 |
| 95-th percentile | 56 |
| Maximum | 1134 |
| Range | 1134 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 28.003679 |
|---|---|
| Coefficient of variation (CV) | 2.0773367 |
| Kurtosis | 71.529141 |
| Mean | 13.480568 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 6.0267533 |
| Sum | 14335762 |
| Variance | 784.20603 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 498613 | 8.6% |
| 1 | 28003 | 0.5% |
| 15 | 23199 | 0.4% |
| 2 | 22981 | 0.4% |
| 3 | 21446 | 0.4% |
| 16 | 21357 | 0.4% |
| 4 | 20305 | 0.3% |
| 17 | 18738 | 0.3% |
| 5 | 18737 | 0.3% |
| 6 | 17671 | 0.3% |
| Other values (560) | 372389 | 6.4% |
| (Missing) | 4755640 |
| Value | Count | Frequency (%) |
| 0 | 498613 | |
| 1 | 28003 | 0.5% |
| 2 | 22981 | 0.4% |
| 3 | 21446 | 0.4% |
| 4 | 20305 | 0.3% |
| 5 | 18737 | 0.3% |
| 6 | 17671 | 0.3% |
| 7 | 16582 | 0.3% |
| 8 | 15644 | 0.3% |
| 9 | 14716 | 0.3% |
| Value | Count | Frequency (%) |
| 1134 | 1 | |
| 1101 | 1 | |
| 1049 | 1 | |
| 991 | 1 | |
| 916 | 1 | |
| 888 | 1 | |
| 872 | 1 | |
| 862 | 1 | |
| 855 | 1 | |
| 850 | 1 |
SECURITY_DELAY
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 154 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4755640 |
| Missing (%) | 81.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.076153874 |
| Minimum | 0 |
|---|---|
| Maximum | 573 |
| Zeros | 1059955 |
| Zeros (%) | 18.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 573 |
| Range | 573 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.1434596 |
|---|---|
| Coefficient of variation (CV) | 28.146428 |
| Kurtosis | 10141.818 |
| Mean | 0.076153874 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 72.127661 |
| Sum | 80985 |
| Variance | 4.5944189 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1059955 | 18.2% |
| 15 | 158 | < 0.1% |
| 8 | 127 | < 0.1% |
| 10 | 125 | < 0.1% |
| 12 | 124 | < 0.1% |
| 6 | 121 | < 0.1% |
| 13 | 119 | < 0.1% |
| 7 | 119 | < 0.1% |
| 9 | 117 | < 0.1% |
| 5 | 116 | < 0.1% |
| Other values (144) | 2358 | < 0.1% |
| (Missing) | 4755640 |
| Value | Count | Frequency (%) |
| 0 | 1059955 | |
| 1 | 99 | < 0.1% |
| 2 | 110 | < 0.1% |
| 3 | 104 | < 0.1% |
| 4 | 106 | < 0.1% |
| 5 | 116 | < 0.1% |
| 6 | 121 | < 0.1% |
| 7 | 119 | < 0.1% |
| 8 | 127 | < 0.1% |
| 9 | 117 | < 0.1% |
| Value | Count | Frequency (%) |
| 573 | 1 | < 0.1% |
| 440 | 1 | < 0.1% |
| 364 | 1 | < 0.1% |
| 256 | 1 | < 0.1% |
| 241 | 1 | < 0.1% |
| 237 | 1 | < 0.1% |
| 227 | 2 | |
| 221 | 3 | |
| 215 | 1 | < 0.1% |
| 214 | 1 | < 0.1% |
AIRLINE_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 1067 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4755640 |
| Missing (%) | 81.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.969547 |
| Minimum | 0 |
|---|---|
| Maximum | 1971 |
| Zeros | 493417 |
| Zeros (%) | 8.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 19 |
| 95-th percentile | 87 |
| Maximum | 1971 |
| Range | 1971 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 48.161642 |
|---|---|
| Coefficient of variation (CV) | 2.5388926 |
| Kurtosis | 134.85064 |
| Mean | 18.969547 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 8.5250098 |
| Sum | 20172956 |
| Variance | 2319.5438 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 493417 | 8.5% |
| 1 | 21319 | 0.4% |
| 2 | 21211 | 0.4% |
| 3 | 20656 | 0.4% |
| 4 | 20184 | 0.3% |
| 6 | 20107 | 0.3% |
| 5 | 19772 | 0.3% |
| 7 | 18646 | 0.3% |
| 8 | 17494 | 0.3% |
| 15 | 16582 | 0.3% |
| Other values (1057) | 394051 | 6.8% |
| (Missing) | 4755640 |
| Value | Count | Frequency (%) |
| 0 | 493417 | |
| 1 | 21319 | 0.4% |
| 2 | 21211 | 0.4% |
| 3 | 20656 | 0.4% |
| 4 | 20184 | 0.3% |
| 5 | 19772 | 0.3% |
| 6 | 20107 | 0.3% |
| 7 | 18646 | 0.3% |
| 8 | 17494 | 0.3% |
| 9 | 16368 | 0.3% |
| Value | Count | Frequency (%) |
| 1971 | 1 | |
| 1878 | 1 | |
| 1665 | 1 | |
| 1636 | 1 | |
| 1631 | 1 | |
| 1625 | 1 | |
| 1593 | 1 | |
| 1587 | 1 | |
| 1576 | 1 | |
| 1563 | 1 |
LATE_AIRCRAFT_DELAY
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 695 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4755640 |
| Missing (%) | 81.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.472838 |
| Minimum | 0 |
|---|---|
| Maximum | 1331 |
| Zeros | 506486 |
| Zeros (%) | 8.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 3 |
| Q3 | 29 |
| 95-th percentile | 107 |
| Maximum | 1331 |
| Range | 1331 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 43.197018 |
|---|---|
| Coefficient of variation (CV) | 1.8402981 |
| Kurtosis | 32.075947 |
| Mean | 23.472838 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 4.0166724 |
| Sum | 24961931 |
| Variance | 1865.9824 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 506486 | 8.7% |
| 15 | 14522 | 0.2% |
| 16 | 13824 | 0.2% |
| 17 | 12908 | 0.2% |
| 18 | 12259 | 0.2% |
| 19 | 11794 | 0.2% |
| 14 | 11183 | 0.2% |
| 20 | 11079 | 0.2% |
| 13 | 10930 | 0.2% |
| 11 | 10517 | 0.2% |
| Other values (685) | 447937 | 7.7% |
| (Missing) | 4755640 |
| Value | Count | Frequency (%) |
| 0 | 506486 | |
| 1 | 9575 | 0.2% |
| 2 | 9388 | 0.2% |
| 3 | 9012 | 0.2% |
| 4 | 8854 | 0.2% |
| 5 | 9038 | 0.2% |
| 6 | 9501 | 0.2% |
| 7 | 9629 | 0.2% |
| 8 | 9912 | 0.2% |
| 9 | 9897 | 0.2% |
| Value | Count | Frequency (%) |
| 1331 | 1 | |
| 1313 | 1 | |
| 1294 | 1 | |
| 1256 | 1 | |
| 1190 | 1 | |
| 1174 | 1 | |
| 1164 | 1 | |
| 1102 | 1 | |
| 1057 | 1 | |
| 1039 | 1 |
WEATHER_DELAY
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 632 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4755640 |
| Missing (%) | 81.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9152899 |
| Minimum | 0 |
|---|---|
| Maximum | 1211 |
| Zeros | 998723 |
| Zeros (%) | 17.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 8 |
| Maximum | 1211 |
| Range | 1211 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 20.433336 |
|---|---|
| Coefficient of variation (CV) | 7.0090235 |
| Kurtosis | 451.69916 |
| Mean | 2.9152899 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.308217 |
| Sum | 3100233 |
| Variance | 417.52121 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 998723 | 17.2% |
| 6 | 1649 | < 0.1% |
| 8 | 1580 | < 0.1% |
| 7 | 1537 | < 0.1% |
| 15 | 1498 | < 0.1% |
| 10 | 1498 | < 0.1% |
| 9 | 1487 | < 0.1% |
| 16 | 1460 | < 0.1% |
| 5 | 1415 | < 0.1% |
| 3 | 1412 | < 0.1% |
| Other values (622) | 51180 | 0.9% |
| (Missing) | 4755640 |
| Value | Count | Frequency (%) |
| 0 | 998723 | |
| 1 | 1308 | < 0.1% |
| 2 | 1397 | < 0.1% |
| 3 | 1412 | < 0.1% |
| 4 | 1319 | < 0.1% |
| 5 | 1415 | < 0.1% |
| 6 | 1649 | < 0.1% |
| 7 | 1537 | < 0.1% |
| 8 | 1580 | < 0.1% |
| 9 | 1487 | < 0.1% |
| Value | Count | Frequency (%) |
| 1211 | 1 | |
| 1152 | 1 | |
| 1120 | 1 | |
| 1118 | 1 | |
| 1116 | 1 | |
| 1068 | 1 | |
| 1052 | 1 | |
| 1039 | 1 | |
| 1035 | 1 | |
| 1021 | 1 |
LATE_AIRCRAFT_DELAY_CAT
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.4 MiB |
| 0 | |
|---|---|
| 1 | 147751 |
| 2 | 128482 |
| 3 | 128149 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5819079 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 5414697 | |
| 1 | 147751 | 2.5% |
| 2 | 128482 | 2.2% |
| 3 | 128149 | 2.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 5414697 | |
| 1 | 147751 | 2.5% |
| 2 | 128482 | 2.2% |
| 3 | 128149 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5414697 | |
| 1 | 147751 | 2.5% |
| 2 | 128482 | 2.2% |
| 3 | 128149 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5819079 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5414697 | |
| 1 | 147751 | 2.5% |
| 2 | 128482 | 2.2% |
| 3 | 128149 | 2.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5819079 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5414697 | |
| 1 | 147751 | 2.5% |
| 2 | 128482 | 2.2% |
| 3 | 128149 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5819079 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5414697 | |
| 1 | 147751 | 2.5% |
| 2 | 128482 | 2.2% |
| 3 | 128149 | 2.2% |
Date
Date
| Distinct | 365 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.4 MiB |
| Minimum | 2015-01-01 00:00:00 |
|---|---|
| Maximum | 2015-12-31 00:00:00 |
| AIRLINE | AIRLINE_DELAY | AIR_SYSTEM_DELAY | AIR_TIME | ARRIVAL_DELAY | CANCELLATION_REASON | CANCELLED | DAY | DAY_OF_WEEK | DEPARTURE_DELAY | DISTANCE | DIVERTED | ELAPSED_TIME | FLIGHT_NUMBER | LATE_AIRCRAFT_DELAY | LATE_AIRCRAFT_DELAY_CAT | MONTH | SECURITY_DELAY | TAXI_IN | TAXI_OUT | WEATHER_DELAY | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AIRLINE | 1.000 | 0.015 | -0.119 | -0.097 | 0.060 | 0.237 | 0.084 | -0.001 | -0.007 | 0.119 | -0.090 | 0.009 | -0.131 | 0.074 | 0.119 | 0.046 | -0.068 | -0.012 | -0.108 | -0.265 | -0.032 |
| AIRLINE_DELAY | 0.015 | 1.000 | -0.362 | 0.051 | 0.190 | 0.000 | 1.000 | 0.007 | 0.033 | 0.304 | 0.078 | 1.000 | -0.005 | -0.057 | -0.213 | 0.037 | 0.006 | -0.049 | -0.120 | -0.132 | -0.220 |
| AIR_SYSTEM_DELAY | -0.119 | -0.362 | 1.000 | 0.110 | 0.018 | 0.000 | 1.000 | -0.015 | -0.016 | -0.392 | 0.034 | 1.000 | 0.270 | 0.001 | -0.355 | 0.036 | -0.029 | -0.009 | 0.289 | 0.487 | -0.000 |
| AIR_TIME | -0.097 | 0.051 | 0.110 | 1.000 | -0.034 | 0.000 | 1.000 | 0.002 | 0.014 | 0.085 | 0.988 | 1.000 | 0.983 | -0.305 | -0.128 | 0.018 | 0.001 | 0.010 | 0.126 | 0.105 | -0.003 |
| ARRIVAL_DELAY | 0.060 | 0.190 | 0.018 | -0.034 | 1.000 | 0.000 | 1.000 | -0.008 | -0.018 | 0.641 | -0.065 | 1.000 | 0.024 | 0.015 | 0.363 | 0.267 | -0.061 | -0.013 | 0.098 | 0.250 | 0.135 |
| CANCELLATION_REASON | 0.237 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | -0.024 | -0.043 | 0.068 | -0.061 | 1.000 | NaN | 0.189 | NaN | 1.000 | -0.063 | NaN | NaN | 0.115 | NaN |
| CANCELLED | 0.084 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | -0.003 | -0.023 | 0.019 | -0.036 | 0.006 | NaN | 0.038 | NaN | 0.034 | -0.055 | NaN | NaN | 0.004 | NaN |
| DAY | -0.001 | 0.007 | -0.015 | 0.002 | -0.008 | -0.024 | -0.003 | 1.000 | 0.001 | -0.004 | 0.004 | 0.005 | 0.002 | 0.002 | 0.000 | 0.011 | 0.009 | 0.004 | -0.001 | -0.003 | 0.008 |
| DAY_OF_WEEK | -0.007 | 0.033 | -0.016 | 0.014 | -0.018 | -0.043 | -0.023 | 0.001 | 1.000 | -0.004 | 0.017 | 0.005 | 0.011 | 0.017 | -0.020 | 0.015 | -0.008 | 0.006 | -0.000 | -0.018 | -0.012 |
| DEPARTURE_DELAY | 0.119 | 0.304 | -0.392 | 0.085 | 0.641 | 0.068 | 0.019 | -0.004 | -0.004 | 1.000 | 0.092 | 0.016 | 0.085 | -0.059 | 0.479 | 0.264 | -0.031 | -0.008 | -0.047 | 0.031 | 0.111 |
| DISTANCE | -0.090 | 0.078 | 0.034 | 0.988 | -0.065 | -0.061 | -0.036 | 0.004 | 0.017 | 0.092 | 1.000 | 0.015 | 0.967 | -0.321 | -0.099 | 0.018 | 0.009 | 0.011 | 0.111 | 0.092 | -0.006 |
| DIVERTED | 0.009 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.006 | 0.005 | 0.005 | 0.016 | 0.015 | 1.000 | NaN | 0.003 | NaN | 0.014 | -0.001 | NaN | 0.014 | 0.008 | NaN |
| ELAPSED_TIME | -0.131 | -0.005 | 0.270 | 0.983 | 0.024 | NaN | NaN | 0.002 | 0.011 | 0.085 | 0.967 | NaN | 1.000 | -0.285 | -0.190 | 0.016 | -0.000 | 0.008 | 0.193 | 0.224 | 0.018 |
| FLIGHT_NUMBER | 0.074 | -0.057 | 0.001 | -0.305 | 0.015 | 0.189 | 0.038 | 0.002 | 0.017 | -0.059 | -0.321 | 0.003 | -0.285 | 1.000 | 0.043 | 0.017 | -0.014 | -0.018 | -0.026 | 0.064 | -0.005 |
| LATE_AIRCRAFT_DELAY | 0.119 | -0.213 | -0.355 | -0.128 | 0.363 | NaN | NaN | 0.000 | -0.020 | 0.479 | -0.099 | NaN | -0.190 | 0.043 | 1.000 | 0.280 | -0.006 | -0.018 | -0.092 | -0.234 | -0.015 |
| LATE_AIRCRAFT_DELAY_CAT | 0.046 | 0.037 | 0.036 | 0.018 | 0.267 | 1.000 | 0.034 | 0.011 | 0.015 | 0.264 | 0.018 | 0.014 | 0.016 | 0.017 | 0.280 | 1.000 | -0.025 | -0.018 | -0.001 | 0.007 | -0.013 |
| MONTH | -0.068 | 0.006 | -0.029 | 0.001 | -0.061 | -0.063 | -0.055 | 0.009 | -0.008 | -0.031 | 0.009 | -0.001 | -0.000 | -0.014 | -0.006 | -0.025 | 1.000 | 0.012 | 0.014 | 0.002 | -0.028 |
| SECURITY_DELAY | -0.012 | -0.049 | -0.009 | 0.010 | -0.013 | NaN | NaN | 0.004 | 0.006 | -0.008 | 0.011 | NaN | 0.008 | -0.018 | -0.018 | -0.018 | 0.012 | 1.000 | -0.006 | -0.003 | -0.013 |
| TAXI_IN | -0.108 | -0.120 | 0.289 | 0.126 | 0.098 | NaN | NaN | -0.001 | -0.000 | -0.047 | 0.111 | 0.014 | 0.193 | -0.026 | -0.092 | -0.001 | 0.014 | -0.006 | 1.000 | 0.011 | -0.009 |
| TAXI_OUT | -0.265 | -0.132 | 0.487 | 0.105 | 0.250 | 0.115 | 0.004 | -0.003 | -0.018 | 0.031 | 0.092 | 0.008 | 0.224 | 0.064 | -0.234 | 0.007 | 0.002 | -0.003 | 0.011 | 1.000 | 0.086 |
| WEATHER_DELAY | -0.032 | -0.220 | -0.000 | -0.003 | 0.135 | NaN | NaN | 0.008 | -0.012 | 0.111 | -0.006 | NaN | 0.018 | -0.005 | -0.015 | -0.013 | -0.028 | -0.013 | -0.009 | 0.086 | 1.000 |
| YEAR | MONTH | DAY | DAY_OF_WEEK | AIRLINE | FLIGHT_NUMBER | TAIL_NUMBER | ORIGIN_AIRPORT | DESTINATION_AIRPORT | SCHEDULED_DEPARTURE | DEPARTURE_TIME | DEPARTURE_DELAY | TAXI_OUT | WHEELS_OFF | SCHEDULED_TIME | ELAPSED_TIME | AIR_TIME | DISTANCE | WHEELS_ON | TAXI_IN | SCHEDULED_ARRIVAL | ARRIVAL_TIME | ARRIVAL_DELAY | DIVERTED | CANCELLED | CANCELLATION_REASON | AIR_SYSTEM_DELAY | SECURITY_DELAY | AIRLINE_DELAY | LATE_AIRCRAFT_DELAY | WEATHER_DELAY | LATE_AIRCRAFT_DELAY_CAT | Date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2015 | 1 | 1 | 4 | AS | 98 | N407AS | ANC | SEA | 00:05 AM | 11:54 PM | -11.0 | 21.0 | 00:15 AM | 02:05 AM | 194.0 | 169.0 | 1448 | 04:04 AM | 4.0 | 04:30 AM | 04:08 AM | -22.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 01/01/2015 |
| 1 | 2015 | 1 | 1 | 4 | AA | 2336 | N3KUAA | LAX | PBI | 00:10 AM | 00:02 AM | -8.0 | 12.0 | 00:14 AM | 02:80 AM | 279.0 | 263.0 | 2330 | 07:37 AM | 4.0 | 07:50 AM | 07:41 AM | -9.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 01/01/2015 |
| 2 | 2015 | 1 | 1 | 4 | US | 840 | N171US | SFO | CLT | 00:20 AM | 00:18 AM | -2.0 | 16.0 | 00:34 AM | 02:86 AM | 293.0 | 266.0 | 2296 | 08:00 AM | 11.0 | 08:06 AM | 08:11 AM | 5.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 01/01/2015 |
| 3 | 2015 | 1 | 1 | 4 | AA | 258 | N3HYAA | LAX | MIA | 00:20 AM | 00:15 AM | -5.0 | 15.0 | 00:30 AM | 02:85 AM | 281.0 | 258.0 | 2342 | 07:48 AM | 8.0 | 08:05 AM | 07:56 AM | -9.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 01/01/2015 |
| 4 | 2015 | 1 | 1 | 4 | AS | 135 | N527AS | SEA | ANC | 00:25 AM | 00:24 AM | -1.0 | 11.0 | 00:35 AM | 02:35 AM | 215.0 | 199.0 | 1448 | 02:54 AM | 5.0 | 03:20 AM | 02:59 AM | -21.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 01/01/2015 |
| 5 | 2015 | 1 | 1 | 4 | DL | 806 | N3730B | SFO | MSP | 00:25 AM | 00:20 AM | -5.0 | 18.0 | 00:38 AM | 02:17 AM | 230.0 | 206.0 | 1589 | 06:04 AM | 6.0 | 06:02 AM | 06:10 AM | 8.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 01/01/2015 |
| 6 | 2015 | 1 | 1 | 4 | NK | 612 | N635NK | LAS | MSP | 00:25 AM | 00:19 AM | -6.0 | 11.0 | 00:30 AM | 01:81 AM | 170.0 | 154.0 | 1299 | 05:04 AM | 5.0 | 05:26 AM | 05:09 AM | -17.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 01/01/2015 |
| 7 | 2015 | 1 | 1 | 4 | US | 2013 | N584UW | LAX | CLT | 00:30 AM | 00:44 AM | 14.0 | 13.0 | 00:57 AM | 02:73 AM | 249.0 | 228.0 | 2125 | 07:45 AM | 8.0 | 08:03 AM | 07:53 AM | -10.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 01/01/2015 |
| 8 | 2015 | 1 | 1 | 4 | AA | 1112 | N3LAAA | SFO | DFW | 00:30 AM | 00:19 AM | -11.0 | 17.0 | 00:36 AM | 01:95 AM | 193.0 | 173.0 | 1464 | 05:29 AM | 3.0 | 05:45 AM | 05:32 AM | -13.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 01/01/2015 |
| 9 | 2015 | 1 | 1 | 4 | DL | 1173 | N826DN | LAS | ATL | 00:30 AM | 00:33 AM | 3.0 | 12.0 | 00:45 AM | 02:21 AM | 203.0 | 186.0 | 1747 | 06:51 AM | 5.0 | 07:11 AM | 06:56 AM | -15.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 01/01/2015 |
| YEAR | MONTH | DAY | DAY_OF_WEEK | AIRLINE | FLIGHT_NUMBER | TAIL_NUMBER | ORIGIN_AIRPORT | DESTINATION_AIRPORT | SCHEDULED_DEPARTURE | DEPARTURE_TIME | DEPARTURE_DELAY | TAXI_OUT | WHEELS_OFF | SCHEDULED_TIME | ELAPSED_TIME | AIR_TIME | DISTANCE | WHEELS_ON | TAXI_IN | SCHEDULED_ARRIVAL | ARRIVAL_TIME | ARRIVAL_DELAY | DIVERTED | CANCELLED | CANCELLATION_REASON | AIR_SYSTEM_DELAY | SECURITY_DELAY | AIRLINE_DELAY | LATE_AIRCRAFT_DELAY | WEATHER_DELAY | LATE_AIRCRAFT_DELAY_CAT | Date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5819069 | 2015 | 12 | 31 | 4 | B6 | 1248 | N948JB | LAS | JFK | 11:59 PM | 02:38 AM | 159.0 | 34.0 | 03:12 AM | 02:82 AM | 282.0 | 243.0 | 2248 | 10:15 AM | 5.0 | 07:41 AM | 10:20 AM | 159.0 | 0 | 0 | NaN | 0.0 | 0.0 | 159.0 | 0.0 | 0.0 | 0 | 31/12/2015 |
| 5819070 | 2015 | 12 | 31 | 4 | B6 | 80 | N584JB | RNO | JFK | 11:59 PM | 11:59 PM | 0.0 | 12.0 | 00:11 AM | 03:06 AM | 285.0 | 268.0 | 2411 | 07:39 AM | 5.0 | 08:05 AM | 07:44 AM | -21.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 31/12/2015 |
| 5819071 | 2015 | 12 | 31 | 4 | B6 | 802 | N589JB | SLC | MCO | 11:59 PM | 00:15 AM | 16.0 | 14.0 | 00:29 AM | 02:49 AM | 250.0 | 211.0 | 1931 | 06:00 AM | 25.0 | 06:08 AM | 06:25 AM | 17.0 | 0 | 0 | NaN | 1.0 | 0.0 | 16.0 | 0.0 | 0.0 | 0 | 31/12/2015 |
| 5819072 | 2015 | 12 | 31 | 4 | B6 | 98 | N607JB | DEN | JFK | 11:59 PM | 00:06 AM | 7.0 | 13.0 | 00:19 AM | 02:11 AM | 193.0 | 173.0 | 1626 | 05:12 AM | 7.0 | 05:30 AM | 05:19 AM | -11.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 31/12/2015 |
| 5819073 | 2015 | 12 | 31 | 4 | B6 | 66 | N655JB | ABQ | JFK | 11:59 PM | 00:15 AM | 16.0 | 9.0 | 00:24 AM | 02:27 AM | 214.0 | 190.0 | 1826 | 05:34 AM | 15.0 | 05:46 AM | 05:49 AM | 3.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 31/12/2015 |
| 5819074 | 2015 | 12 | 31 | 4 | B6 | 688 | N657JB | LAX | BOS | 11:59 PM | 11:55 PM | -4.0 | 22.0 | 00:17 AM | 03:20 AM | 298.0 | 272.0 | 2611 | 07:49 AM | 4.0 | 08:19 AM | 07:53 AM | -26.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 31/12/2015 |
| 5819075 | 2015 | 12 | 31 | 4 | B6 | 745 | N828JB | JFK | PSE | 11:59 PM | 11:55 PM | -4.0 | 17.0 | 00:12 AM | 02:27 AM | 215.0 | 195.0 | 1617 | 04:27 AM | 3.0 | 04:46 AM | 04:30 AM | -16.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 31/12/2015 |
| 5819076 | 2015 | 12 | 31 | 4 | B6 | 1503 | N913JB | JFK | SJU | 11:59 PM | 11:50 PM | -9.0 | 17.0 | 00:07 AM | 02:21 AM | 222.0 | 197.0 | 1598 | 04:24 AM | 8.0 | 04:40 AM | 04:32 AM | -8.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 31/12/2015 |
| 5819077 | 2015 | 12 | 31 | 4 | B6 | 333 | N527JB | MCO | SJU | 11:59 PM | 11:53 PM | -6.0 | 10.0 | 00:03 AM | 01:61 AM | 157.0 | 144.0 | 1189 | 03:27 AM | 3.0 | 03:40 AM | 03:30 AM | -10.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 31/12/2015 |
| 5819078 | 2015 | 12 | 31 | 4 | B6 | 839 | N534JB | JFK | BQN | 11:59 PM | 00:14 AM | 15.0 | 14.0 | 00:28 AM | 02:21 AM | 208.0 | 189.0 | 1576 | 04:37 AM | 5.0 | 04:40 AM | 04:42 AM | 2.0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 31/12/2015 |